User interface language: English | Español

HL Paper 3

The random variables X , Y follow a bivariate normal distribution with product moment correlation coefficient ρ.

A random sample of 11 observations on X, Y was obtained and the value of the sample product moment correlation coefficient, r, was calculated to be −0.708.

The covariance of the random variables U, V is defined by

Cov(U, V) = E((U − E(U))(V − E(V))).

State suitable hypotheses to investigate whether or not a negative linear association exists between X and Y.

[1]
a.

Determine the p-value.

[3]
b.i.

State your conclusion at the 1 % significance level.

[1]
b.ii.

Show that Cov(U, V) = E(UV) − E(U)E(V).

[3]
c.i.

Hence show that if U, V are independent random variables then the population product moment correlation coefficient, ρ, is zero.

[3]
c.ii.

Markscheme

* This question is from an exam for a previous syllabus, and may contain minor differences in marking or structure.

H0 : ρ = 0; H1 ρ < 0       A1

[1 mark]

a.

t = 0.708 11 2 1 ( 0.708 ) 2 = ( 3.0075 )        (M1)

degrees of freedom = 9        (A1)

P(T < −3.0075...) = 0.00739       A1

Note: Accept any answer that rounds to 0.0074.

[3 marks]

b.i.

reject H0 or equivalent statement       R1

Note: Apply follow through on the candidate’s p-value.

[1 mark]

b.ii.

Cov(U, V) + E((U − E(U))(V − E(V)))

= E(UV − E(U)V − E(V)+ E(U)E(V))       M1

= E(UV) − E(E(U)V) − E(E(V)U) + E(E(U)E(V))       (A1)

= E(UV) − E(U)E(V) − E(V)E(U) + E(U)E(V)       A1

Cov(U, V) = E(UV) − E(U)E(V)       AG

[3 marks]

c.i.

E(UV) = E(U)E(V) (independent random variables)       R1

⇒Cov(U, V) = E(U)E(V) − E(U)E(V) = 0      A1

hence, ρ =  Cov ( U , V ) Var ( U ) Var ( V ) = 0      A1AG

Note: Accept the statement that Cov(U,V) is the numerator of the formula for ρ.

Note: Only award the first A1 if the R1 is awarded.

[3 marks]

c.ii.

Examiners report

[N/A]
a.
[N/A]
b.i.
[N/A]
b.ii.
[N/A]
c.i.
[N/A]
c.ii.



Peter, the Principal of a college, believes that there is an association between the score in a Mathematics test, X , and the time taken to run 500 m, Y seconds, of his students. The following paired data are collected.

It can be assumed that ( X Y ) follow a bivariate normal distribution with product moment correlation coefficient ρ .

State suitable hypotheses H 0 and H 1 to test Peter’s claim, using a two-tailed test.

[1]
a.i.

Carry out a suitable test at the 5 % significance level. With reference to the  p -value, state your conclusion in the context of Peter’s claim.

[4]
a.ii.

Peter uses the regression line of y on x as y = 0.248 x + 83.0 and calculates that a student with a Mathematics test score of 73 will have a running time of 101 seconds. Comment on the validity of his calculation.

[2]
b.

Markscheme

H 0 : ρ = 0     H 1 : ρ 0        A1

Note: It must be ρ .

[1 mark]

a.i.

p = 0.649        A2

Note: Accept anything that rounds to 0.65

0.649 > 0.05        R1

hence, we accept  H 0 and conclude that Peter’s claim is wrong         A1

Note: The A mark depends on the R mark and the answer must be given in context. Follow through the p -value in part (b).

[4 marks]

a.ii.

a statement along along the lines of ‘(we have accepted that) the two variables are independent’ or ‘the two variables are weakly correlated’       R1

a statement along the lines of ‘the use of the regression line is invalid’ or ‘it would give an inaccurate result’       R1

Note: Award the second R1 only if the first R1 is awarded.

Note: FT the conclusion in(a)(ii). If a candidate concludes that the claim is correct, mark as follows: (as we have accepted H1) the 2 variables are dependent and 73 lies in the range of x values R1, hence the use of the regression line is valid R1

[2 marks]

b.

Examiners report

[N/A]
a.i.
[N/A]
a.ii.
[N/A]
b.